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ABSTRACT 

We analyze a sample of 23 supermassive elliptical galaxies (central velocity dispersion 
larger than 330 km s -1 ), drawn from the SDSS. For each object, we estimate the 
dynamical mass from the light profile and central velocity dispersion, and compare 
it with the stellar mass derived from stellar population models. We show that these 
galaxies are dominated by luminous matter within the radius for which the velocity 
dispersion is measured. We find that the sizes and stellar masses are tightly correlated, 
with R e oc , making the mean density within the de Vaucouleurs radius a steeply 
declining function of M*: p e oc M~ 2 ' 2 . These scalings are easily derived from the 
virial theorem if one recalls that this sample has essentially fixed (but large) <jq. In 
contrast, the mean density within f kpc is almost independent of M*, at a value that 
is in good agreement with recent studies of z ~ 2 galaxies. The fact that the mass 
within 1 kpc has remained approximately unchanged suggests assembly histories that 
were dominated by minor mergers - but we discuss why this is not the unique way to 
achieve this. Moreover, the total stellar mass of the objects in our sample is typically 
a factor of ~ 5 larger than that in the high redshift [z ~ 2) sample, an amount which 
seems difficult to achieve. If our galaxies are the evolved objects of the recent high 
redshift studies, then we suggest that major mergers were required at z > 1.5, and 
that minor mergers become the dominant growth mechanism for massive galaxies at 
z<1.5. 

Key words: galaxies: formation - galaxies: evolution - galaxies: elliptical and lentic- 
ular, cD - galaxies: high-redshift. 



1 INTRODUCTION 

The study of luminous and dark matter in elliptical galaxies 
is crucial to understanding the formation of massive galax- 
ies in our Universe. In hierarchical models, ellipticals are 
the result of interactions and mergers of spiral galaxies (e.g. 
Blumenthal et al. 1984). This is in contrast to a scenario 
in which they form from a monolithic collapse (e.g. Eggen 
et al. 1962; Granato et al. 2004). The most massive ellipti- 
cal galaxies, with baryonic masses M > 1O 11 M0, are chal- 
lenging because they should have formed in the very early 
universe and at the same time undergone a large deal of 
merging. 

There is now growing evidence that massive galaxies 
(M 4 ~ 10 11 Mq) did exist at z ~ 2. Some work suggests that 
they were much smaller and denser than their local coun- 
terparts of the same stellar mass (e.g. Trujillo et al. 2006; 



van Dokkum et al. 2008; Cimatti et al. 2008; Saracco et al. 
2009) and that similar compact galaxies to those observed 
at high-redshift do not exist in the local universe (e.g. Tru- 
jillo et al. 2009). These results raised the question of what 
process or processes have acted to increase the sizes of these 
objects to make them consistent with the larger sizes we see 
at late times (e.g. van Dokkum et al. 2008; Fan et al. 2008). 

Bezanson et al. (2009) showed that the stellar density 
within the central 1 kpc of ellipticals at z ~ 2.3 is similar 
to that for nearby ellipticals (they differ by only a factor 
of ~ 2, compared to a difference of a factor of ~ 100 if 
the comparison is done within the half-light radius). This 
suggests an inside-out, hierarchical growth scenario domi- 
nated by dry minor mergers which add mass primarily to 
the outer regions (e.g. Loeb & Peebles 2003; Bournaud et 
al. 2007; Naab et al. 2007; Hopkins et al. 2009). 

However, the evidence for small sizes at high-redshift 
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and the lack of such objects at low-redshift is not uncon- 
tested. For example, Mancini et al. (2010) have argued that 
neglect of low surface brightness features will bias r e to small 
values. Their analysis shows that some of the objects at 
2 ~ 1.5 are not small for their M* compared to z — ob- 
jects. A similar result was recently presented by Onodera 
et al. (2010) who found a z — 1.82 analog of local ultra- 
massive elliptical galaxies. Recently, Saracco et al. (2010) 
found from a complete sample of 34 early-type galaxies at 
0.9 < z < 1.92 that 21 of these are similar to the local ones 
even though they are co-eval with more compact early-type 
galaxies. In addition, Valentinuzzi et al. (2010) found that 
about 25% of the objects with M» > 3 x 10 10 M in lo- 
cal clusters are superdense (i.e. they have sizes like those 
observed out to z ~ 2). However, they found that there is 
strong evidence for a large evolution in radius for the most 
massive galaxies, i.e. BCGs (M, > 4 x 10 11 M©). 

Indeed, evolution in the properties of local BCGs was 
detected by Bernardi (2009), who showed that the sizes 
and velocity dispersions of BCGs (and of massive early-type 
galaxies) in the local Universe (z < 0.3) are still evolving. 
This work suggests that minor dry mergers dominate the as- 
sembling of BCGs at lower redshifts since the observed size 
evolution is more rapid than expected by major dry merg- 
ers once one accounts for the small changes in the observed 
luminosity/mass functions (~ 50% since z < 1; e.g. Wake 
et al. 2006; Brown et al. 2007; Cool et al. 2008). Minor dry 
mergers are better able to reconcile the observations of size 
evolution with little mass evolution. In addition, Bernardi 
(2009) also claims that minor mergers can also help to rec- 
oncile the substantial growth (~ a factor of 2) predicted for 
the dark matter halos since z ~ 1 (e.g. Sheth & Tormen 
1999) with the little stellar mass evolution in the central 
galaxies (i.e. ~ 50%): indeed, the fractional mass growth of 
BCGs need not be the same as that of their host clusters - 
some of the added stellar mass must make the intercluster 
light. 

However, while an inside-out, hierarchical growth sce- 
nario dominated by minor dry mergers can describe the as- 
sembling of BCGs at low redshift, the recent analysis of 
Bernardi et al. (2010b) suggests that some of the features 
observed in the scaling relations of massive early-type galax- 
ies at M* > 2 x 10 1 Mq (e.g. the upwards curvature in 
the color— M* relation, the decrease in the mean axis ratio 
and color gradients and the fact that most scaling relations 
with a are well-described by a single power law) can only be 
explained by an assembly history dominated by major dry 
mergers above this mass. 

The main goal of this paper is to use both visible stellar 
masses and dynamical stellar masses, for 23 supermassive 
elliptical galaxies identified by Bernardi et al. (2006, 2008), 
to investigate these issues in the light of previous work. 

The paper is organized as follows. We describe the ob- 
servables in Section[2l our procedure for estimating the total 
dynamical masses, and the fraction which is in stars, in Sec- 
tion[3l a comparison of these dynamical mass estimates with 
those from stellar population models in Section [4] and scal- 
ing relations between size and mass in Section [5] A final 
section discusses our findings: we argue that it is not clear 
that minor mergers since z ~ 2 can have been the dominant 
formation mechanism of these massive galaxies. 

When necessary, we assume a flat background cosmol- 



Table 1. Properties of the 23 supermassive elliptical galaxies of 
our sample. Galaxies are identified using the same index, col- 
umn(l), as in Hyde et al. (2008). Column (2) gives the velocity 
dispersion within the 3 arcsec SDSS fiber; Column (3) the pro- 
jected half-light radius from fitting to a de Vaucouleurs profile 
in the i-band; Column (4-5) the projected half-light radius and 
Sersic index from fitting to a Sersic profile in the i-band; and Col- 
umn (6), the physical scale on which the velocity dispersion was 
measured (corresponding to 3 arcsec). 
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ogy that is dominated by a cosmological constant at the 
present time: fio = 0.3, Ao = 1 — fio, and we set Hubble's 
constant to Ho = 70 km s~ Mpc~ . 



2 OBSERVATIONAL PROPERTIES OF GIANT 
ELLIPTICALS 

We use the sample of 23, z < 0.3, supermassive elliptical 
galaxies, selected by Bernardi et al. (2006, 2008) from the 
SDSS database on the basis of their large velocity disper- 
sions: a >330 km s _1 . Surface brightness fits and analyses 
of the SDSS and HST-based light profiles are presented in 
Hyde et al. (2008). An overview of the galaxy properties 
that we need for this study is gathered in Table [TJ 

For all the objects in our sample, the observed surface 
brightness distribution was fit to de Vaucouleurs (DeV) and 
Sersic (S) profiles: 
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where 7o and r e are the central surface brightness and the 
projected half-light radius (r e is given Table [1]). (We use the 
parameters provided by Hyde et al. 2008, rather than those 



The inner structure of massive elliptical galaxies 3 



output from the SDSS database. Hyde et al. also provide fits 
to Bulge/Disk decompositions, which we do not use here.) 
We use 7 to denote the mass-to-light ratio, and we define the 
surface mass density E(-R) = jI(R), then the deprojected 
density, computed by inverting the Abel equation, is 



riS(fl) dR 

dR yfR2~ 



The mass within (a sphere of radius) 
M*(<r)=47r / dxx 2 p*(x). 



(3) 



(4) 



Note that we could have defined the analogous quantities 
for the light, then p*(r) = 7Pl(t*) and M*(< r) = 7 L(< 
r). The quantity of most interest in this paper is the total 
stellar mass: M t = M*(oo) = 7_L(oo). We describe how we 
estimate it in the next section. 

The other observed quantity is the average velocity dis- 
persion Co within the SDSS fiber, which has a diameter 
(2r ap ) of 3 arcsec, that corresponds to about 5-10 kpc ac- 
cording to the distance of our galaxies. (Note that here we do 
not use a aperture corrected to r e /8 reported by Bernardi 
et al. 2006, 2008.) The value of 00 is related to the line- 
of-sight velocity dispersion of the object, weighted by the 
surface-brightness profile: 
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where L(r ap ) — 2ty J^ ap I(R) RdR and af os (R) is the veloc- 
ity dispersion at projected distance R from the center. 

The quantity 00 is related to the gravitational potential, 
and hence to the total mass of the system, as follows. 



3 JEANS' EQUATION ANALYSIS 

In what follows, we provide an estimate of the total mass, 
and the fraction of this which is in stars, for the objects in 
our sample. Our analysis assumes that the objects - both 
the stellar and the dark matter components - are spherical, 
with constant stellar mass-to- light ratios, and no anisotropic 
velocities. This is extremely idealized: Bernardi et al. (2008) 
have argued that many of these objects are likely to be pro- 
late objects viewed along the long axis. We discuss this more 
complex case in an Appendix, but since none of our conclu- 
sions are sensitive to this, we have kept the simpler (spheri- 
cal, isotropic) model in the main body of the text. Bernardi 
et al. also argue that some of these objects may be rotating 
- an effect we do not include in our analysis. We have not 
removed such objects from our analysis, since it is interest- 
ing that they appear to show similar scalings as the objects 
which are not rotating. 



3.1 Spherical symmetry and isotropic velocities 

The 1-D Jeans equation in spherical symmetry (Binney & 
Tremaine 1987) relates the radial velocity dispersion oy(r) 
to the mass distribution: 

y +2 P (r) P ^ r) = - P *(r) GM{ f r \ (6) 
dr r r 2 



where p* is the density of the stellar component at r, M(< r) 
is the total mass (M* + Mdm) within r and /3 is the 
anisotropy profile. Studies of nearby elliptical galaxies in- 
dicate that the tangential anisotropy within the half-light 
radius is negligible (e.g. Matthias & Gerhard 1999; Gerhard 
et al. 2001; Koopmans & Treu 2003; Koopmans et al. 2009). 
Therefore we set ft — in what follows - the Appendix shows 
how our results are modified if ft ^ 0. Thus, equation ([6]) 
implies 
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An observer only measures the projection along the line of 
sight, (Tios{r), which is given by 
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Inserting equation (J7J in Q, and this in ([5| shows how the 
total dynamical mass is related to the observed light profile 
and <7o. 

3.2 Insignificance of dark matter in 00 

In what follows, we estimate the total mass within some 
radius r as the sum of the stellar mass (obtained from the 
observed light profile with the assumption of constant mass- 
to-light ratio) and that of the dark matter, for which we use 
the fitting formula of Navarro et al. (1996; hereafter NFW). 

For an NFW profile with total mass M v i r , the virial 
radius is 



(9) 



(10) 



kp^ = ^lO^M© 

The mass within some r < r v i r is given by 
log(l +x)-r£r 

M DM {< x) = Mvir , j- , i±2-, 

log(l + c) - 

where x — r/r s = cr/r vir : r s is a characteristic scale length 
for the halo, and the final equality defines the concentration 
parameter 

Note that massive halos are less concentrated. For small x, 

Mbm{<x) x 2 /2 



log(l + c) 



(12) 



l + c 



for c = 9.3, this is x 2 /2.86. 

We model the total mass of each galaxy in our sample as 
a superposition of the deprojected de Vaucouleurs or Sersic 
profile having total mass M* and an NFW halo with a total 
mass M v i r . This represents a compromise: adiabatic contrac- 
tion arguments suggest that the dark matter should become 
more centrally concentrated than a pure NFW profile, as 
the gas which formed the stellar component shrinks towards 
the center - we are ignoring this effect (see Padmanabhan 
et al. 2004; Schulz et al. 2010; Treu et al. 2010 for recent 
analyses of other samples in which this effect is included). 
On the other hand, observations suggest that galaxies have 
a cored rather than a cuspy halo (e.g. Salucci et al. 2007) - 
even an uncontracted NFW profile is too steep. 
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Figure 1. Left: Radial velocity dispersion velocity profile for a massive elliptical galaxy having stellar mass M, = 10 Mq surrounded 
by a dark matter halo of mass M v i r = 30M*. The radial distributions of these two components are described in the main text. The 
dotted line (green) represents the dark matter component, the dashed line (red) the stellar component (for a de Vaucouleurs profile), 
and the solid line (blue) the total profile. For such objects, the dark matter does not contribute significantly within the first kpc. Right: 
Central velocity dispersion measured within a fiber of projected radius 7 kpc (ctP m ), for the dark matter component only, as M v i r is 
increased. 



Table 2. Results of the modelling. Contribution of the dark mat- 
ter component on the total dispersion velocity (column 2). Dy- 
namical mass computed by solving the Jeans equation (column 
3-4). Photometric stellar mass using the model of Maraston (col- 
umn 5-6). 
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Notice that big galaxies are likely to have some scatter 
around this number (i.e. 30). In fact for smaller values, our 
conclusion that the stars dominate the mass will be stronger. 
We demonstrate that the mass within r ap is dominated 
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Figure 2. Constant of proportionality A between M, and r ap a^ 
(equation 1161) , where M* is computed by solving the Jeans equa- 
tion (see text). 



by the stars, 

V s — T v i r jc ~ 



not the dark matter. For M vi . 



10 13 M Q , 



60 kpc, and the NFW mass within r ap is ap- 



To proceed further, we assume that : 
M viT = 30M*, 

following Shankar et al. (2006). This makes 



proximately 30M* (r ap /r s ) /2.86, whereas the stellar mass 
within r ap is slightly less than M*/2. Since r ap /r s <C 1, 
the total mass within r ap is dominated by the stellar mass. 
Consequently, a a is also dominated by the stellar mass. 

We show this explicitly in Figure [1] The left hand panel 
shows CTi os (r) , obtained from equation [7] [S] for an elliptical 
galaxy with Af* = 1O 12 M0 and r e = 6 kpc. We see that 
the contribution from the dark matter component is always 
much smaller than that of the stellar component (a®^ 1 <C 

a los)- 

Then we show that a$ M is always negligible compared 
to (To observed (right hand panel). Since ctq = ctq 1 + a^ >M 2 , 
within our assumptions, the measured value of cjq provides 
a good estimate of Af, , without being contaminated by the 
dark matter component. The effect of DM is small, we can 
evaluate it and correct the estimation of ctq by assuming 
equation 1131 On average the dark matter component con- 
tributes to less than 5% to the total velocity dispersion (Ta- 
ble 0. 

Now we build a new stellar mass estimator that exploits 
the quantity ero. The Jeans equation implies that the quan- 
tity r ap aQ should be proportional to a function of r ap /r e . If 
we set 



M* = \(r ap al), 



(15) 
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Figure 3. Top: Dynamical mass versus luminosity in the i-band. 
Bottom: stellar mass- to-light ratio (computed from the dynamical 
mass) versus color g-r. 



the actual value of A plotted in Figure [2] can be fitted by: 



A = 4.5 



(16) 



As a check, note that if the power-law above had slope — 1, 
then M* oc r R a^\ this is the scaling that is usually assumed. 
The zero-point of the M* oc r e a% relation, 4.5, is close to that 
commonly assumed for a pure de Vaucouleurs profile (e.g. 
Bernardi et al. 2010a). It is also close to that derived from 
analyses which assume adiabatic contraction (Padmanabhan 
et al. 2004). 

The top panel of Figure [3] shows the dynamical stellar 
mass computed as described above, versus the luminosity 
in the i-band. The bottom panel shows the corresponding 
mass-to- light ratio versus color (g-r). 



4 COMPARISON WITH STELLAR MASS 
ESTIMATES FROM PHOTOMETRY 

We estimate the stellar masses M* p , from the stellar popu- 
lation (hereafter SP) model of Maraston et al. (2009) (avail- 
able at www.maraston.eu). This is a two-component model 
made by a major metal-rich population and a low percent- 
age (3%) of an old and metal-poor population with the same 
age. Moreover, an improvement in the spectra of K-giants 
is included through the empirical stellar spectra of Pick- 
les (1998). The initial mass function in the Maraston et al. 
(2009) paper was a Salpeter (1955) and we did not mod- 
ify this assumption here. We use these models because they 



trace the color evolution of Luminous Red Galaxies (LRGs) 
in SDSS from redshift 0.1 to 0.6 much better than previous 
attempts did (see Maraston et al. 2009). 

We obtain the stellar masses through Spectral Energy 
Distribution (SED) fitting, as in Maraston et al. (2006), 
e.g., by using the Maraston et al. 2009 templates in the 
Hyper-Z code (Bolzonella et al. 20000. We computed the 
stellar masses using both de Vaucouleurs and Sersic mag- 
nitudes. The first set were obtained by fitting the ugriz de 
Vaucouleurs magnitudes available from the SDSS database, 
while the second set was computed rescaling the SDSS de 
Vaucouleurs magnitudes by the difference between the i- 
band Sersic and de Vaucouleurs magnitude observed by 
Hyde et al. (2008). This is a good approximation since color 
gradients are small. 

The panel on the left of Figure [4] shows our M* yn esti- 
mates versus those from the stellar population model. Note 
that M? yn = Mf F , the stellar masses as obtained with 
these composite models are in agreement with the dynamical 
masses. 



5 STRUCTURAL PROPERTIES OF GIANT 
ELLIPTICALS 

We now analyze the correlation between size and mass in 
our objects. The left hand panel of Figure [5] shows the cor- 
relation between r e and M* using our dynamical estimates 
of M*. The solid line shows the direct fit (i.e. (r e \M*)) 



kpc 



7.2 



1O 12 M 



(17) 



This relation is significantly steeper than the (r e \M t ) oc 
M*' 6 that is usually reported (e.g. Hyde & Bernardi 2009). 
However, if we recall that these objects all have large a, then 
the relevant comparison is to the relation at fixed M, and a. 
This slope is close to 0.9 (Bernardi et al. 2008). In this case, 
it is plausible that our (now only slightly) steeper slope is 
due to the fact that our M* estimator is less noisy. 

In fact, the small scatter in our relation can also 
be understood in these terms. Equations (|15[) and (|16l) 
show that our dynamical estimate of M, is proportional 



to (r e /r av ) 



= (r« P /r B )° 



Figure [5] shows that 



there is little scatter around this relation. However, our sam- 
ple has essentially fixed a, and r ap /r e has only a small scat- 
ter, so Figure [5] is almost a plot of r e vs r e , with the zero- 
point being set by the mean a and r ap /r e values in the sam- 
ple. This is why the slope is close to unity. Of course, if the 
stellar distribution were not homologous, or the dark matter 
was a significant fraction of the total mass, then r ap /r e need 
not have small scatter. 

The diamonds, squares, triangles show these same re- 
lations, but now obtained from a sample of z ~ 2.3 objects 
by Bezanson et al. (2009), Damjanov et al. (2009), Mancini 
et al. (2010) and Longhetti et al. (2007), respectively. A 



1 We checked that had we fitted a composite model with identical 
characteristics, but obtained with the Maraston (2005) SP mod- 
els, i.e. the standard models based on the Kurucz (1979) model 
atmospheres, we would have obtained the same results. 
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Figure 4. Le/i: Dynamical masses from the Jeans equation versus stellar masses from the composite stellar population models by 
Maraston et al. (2009). Dashed line shows M* yn = Mf p . Right: Ratio of M* yn / M * P as a function of M dyn . 




Stellar Mass [M s ] Stellar Mass [M e ] 

Figure 5. Le/i: Correlation between the characteristic radius, re, and the stellar mass M». The black circles correspond to our sample 
(where the mass is estimated from the Jeans equation analysis). Open and filled symbols show the objects modelled using a Sersic or de 
Vaucouleurs profile, respectively. The dotted line shows the relation for the full sample of early-types from Hyde & Bernardi (2009) and 
the dashed line is from Shen et al (2003). Right: Correlation between the mean stellar density within r e and M„. 



number of studies have noted that the z ~ 2 objects are sig- 
nificantly smaller than z ~ objects of the same M» (dotted 
or dashed curves). Note that Bezanson et al., Mancini et al., 
Longhetti et al. and Damjanov et al. computed their stel- 
lar masses with different IMF (Kroupa, Chabrier, Salpeter 
and Baldry & Glazebrook) and different stellar population 
models (Bruzual & Chariot 2003 and Maraston 2005). The 
fit from Hyde & Bernardi (2009) and Shen et al. (2003) 
were computed using a Chabrier IMF. In order to com- 
pare these different samples we recalibrated the masses to a 
Salpeter IMF (as used in Maraston et al. 2009) using these 
scale-factors: M? alpeter = L6M? roupa = 1.78Aff habriBr = 



2M" . In addition, to account for differences in stellar 
population models we rescaled the Bezanson et al. data mul- 
tiplying their stellar masses by 0.7 (as e.g. in Mancini et al. 
2010, see also Muzzin et al. 2009) and by 0.5 for the Dam- 
janov et al. stellar masses. 

The evidence for small sizes at z ~ 2 is not uncon- 
tested. Mancini et al. (2010) have argued that neglect of 
low surface brightness features will bias r e to small values 
(while the bias in M* is small - C. Mancini private commu- 
nication). Accounting for this effect in a sample at z ~ 1.5 
yields the open squares in Figure [5] Evidently, these objects 
are not small for their M* compared to z = objects. If 
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Figure 6. Left: Stellar mass density within r e versus the stellar mass density within 1 kpc. Right: Correlation between stellar mass 
density within 1 kpc and the total stellar mass. In both panels, open symbols are from a Sersic profile, while filled symbols are from a 
de Vaucouleurs profile. 



both Mancini et al. and Bezanson et al. are correct, and 
both probe the massive end of the population at their re- 
spective mean redshifts, then there must have been signifi- 
cant evolution in size and stellar mass between z = 2.3 and 
z = 1.5. 

In view of this discussion, it is remarkable that the 
z — 2.3 compact galaxies appear to trace the small size 
and M* end of the relation we find at z ~ 0, for fixed a 
(solid line). The z ~ 1.5 sample of Mancini et al. (2010) 
is confined to a narrower range of M*, making it difficult 
to define a relation. However, at this M*, the difference be- 
tween the solid line (a ~ 400 km s _1 ) and the others (bulk 
of early-type population) is a factor of two or less. It will be 
interesting if future measurements show the z — 2.3 objects 
to have a ~ 400 km s _1 , and even more so if this is also 
true for the most compact of the objects in the Mancini et 
al. sample. 

We now turn to a slightly different version of the corre- 
lation between size and mass, namely that between average 
density and mass. For this purpose, it is useful to define 



3 

Pr = ~ 



(18) 



This characteristic density is a sharply declining func- 
tion of stellar mass - the decline is significantly steeper 
than previously reported for a sample which includes the 
full range of early-types (e.g. Bernardi et al. 2003; Hyde & 
Bernardi 2009). Following our discussion of the r e — M* re- 
lation above, the more relevant comparison may be with 
the p e — M* relation at fixed a, for which the slope is 
— 1.8 (Bernardi et al. 2008). Our current estimate is slightly 
steeper, perhaps because our mass estimate is less noisy. To 
see this, note that we could have derived the slope from the 
fact that p e oc M*/rl oc M^ 2 ' 21 , where the final expression 
uses the fact that the scatter between r e and M* is small 
(which we argued was a consequence of the fact that our 
sample has only a small range of a, a small range in r c /r ap , 
and that Figure [2] has small scatter). We note that, while 
the 2 = 2.3 objects have p e orders of magnitude larger than 
the bulk of the 2 = objects of the same M* (compare dia- 
monds with dashed or dotted curves), they are only slightly 
denser than z = objects of the same M*, if such objects 
had a ~ 400 km s" 1 (extrapolate solid line to small M»). 

The steepness of this relation stands in stark contrast 
to the relation between pi and M* . The right-hand panel of 
Figure [5] shows that this relation is very shallow. We find 



the average density within r. In what follows, we will pay 
special attention to p e and pi: the mean density within the 
de Vaucouleurs radius r e , and within 1 kpc, respectively. 
Whereas p e can be thought of as a characteristic density, pi 
is more like the central density (recall that, for this sample, 
r e « 10 kpc). 

The right-hand panel of Figure shows the correlation 
between p e and M* , using the same format as for the panel 
on the left. Fitting to the relation defined by our dynamically 
estimated M» yields 



Pi 



Mq kpc 



1.8 x 10 1 



1O 12 M 
Mi, 



0.25±0.05 



(20) 



While the mass varies by 2 orders of magnitude (from 10 11 
Mq to 5 x 10 12 Mq), the central density remains constant 
at about 1.8 x 10 10 Mq kpc -3 . 

The left-hand panel of Figure [6] shows another way of 
presenting this information: while p e varies by 3 orders of 
magnitude, pi varies by less than a factor of 2. We find 



Pe 

M e kpc" 



2.3 x 10 s 



lO^M, 



2.22±0.04 



(19) 



Pi 

Mq kpc" 



1.3 x 10 a 



Pe 

Mq kpc" 



(21) 
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Thus the central density is approximately constant for all 
the galaxies of our sample. 

It is quite remarkable that they show the same relation 
as in the samples of Mancini et al. and Longhetti et al. at 
z ~ 1.5 as well as of Bezanson et al. (2009) at z ~ 2.3. 
Although p e is orders of magnitude larger in the z ~ 2.3 
sample than at z — 0, pi is different by only a factor of two 
or so. 



6 DISCUSSION AND CONCLUSIONS 

We analyzed a sample of 23 giant elliptical galaxies. They 
are very massive (M > 10 1 Mq) with large central velocity 
dispersions > 330 km s _1 , which suggests old stellar popula- 
tions (Bernardi et al. 2005) . We estimated dynamical masses 
for each of these systems by using the observed light pro- 
files and central velocity dispersions, by solving the Jeans 
Equation, under the assumptions of spherical symmetry, no 
tangential velocity dispersions, no radial dependence of the 
stellar mass-to-light ratio, and that each system has 30 times 
more dark matter than stellar matter within the virial ra- 
dius, with the dark matter following an NFW profile (i.e. no 
accounting for adiabatic contraction). 

We found that the total stellar masses of these systems 
vary from about lO n Af to 5 x 10 12 M e (Table©. In such 
models, the contribution of the dark matter modifies the 
central velocity dispersion by less than 5% (Figure©. Thus, 
for these objects, the observed oo provides a good estima- 
tor of the luminous mass (equation I16|l . We compared the 
masses we derive to estimates of the stellar mass from stellar 
population models (Figure©. We find good agreement us- 
ing a composite model with high age and a small (by mass) 
metal-poor sub-component. This model fits well the colors 
of Luminous Red Galaxies in SDSS. A major result of this 
study is that we compute the mass-to-light ratio for massive 
elliptical galaxies. This ratio is roughly constant for all the 
sample (Figure©, we find M*/L ~ 5 ± 1 in the i-band, for 
0.7 < g - r < 0.9. 

We also studied different ways of presenting the cor- 
relation between size and mass (Figure © . The correlation 
we find, r e oc M*' or , is slightly steeper than that reported 
by Bernardi et al. (2008). We suggest that this is because 
our estimate of M* is less noisy. For similar reasons, we find 
that the density within r e is a more strongly decreasing func- 
tion of M* than reported by Bernardi et al. (2008) . We find 
p e oc M*/r 3 e oc M~ 2,2 compared to their —1.8. 

A notable result is that galaxies at z ~ 2.3 (Bezanson 
et al.) and z ~ 1.5 (Mancini et al.) appear to follow the 
same relation that we find at z ~ 0. If each sample correctly 
gather the most massive galaxies for each range of redshift, 
the evolution between the size and the stellar mass should 
be meaningful. 

The small scatter associated with our dynamical esti- 
mator of M» means that, in the space of r e , p e and M*, the 
objects in our sample trace out a one-dimensional curve (al- 
though we have argued, as did Bernardi et al. 2008, that the 
r e oc M* scaling we find is consistent with the simplest virial 
theorem scaling, once we account for the fact that these ob- 
jects have essentially fixed a). This is also true in the space 
of r e , a and M t . We show this explicitly in Figure [7] Had 
we used the stellar population-based estimate of M t , these 



curves would have been broadened into a plane. Since the 
(r e , (Jo, M*) projection is similar to that of the Fundamental 
Plane, our results suggest that scatter in the relation be- 
tween L and M», or uncertainties in estimating M* serve to 
enhance the impression of a plane rather than a curve. 

On the other hand, some of the decreased scatter in our 
analysis is due to our neglect of anisotropic velocity disper- 
sion profiles. We explore this in the Appendix. Nevertheless, 
it is likely that the scaling relation between r e and M* of gi- 
ant ellipticals is significantly steeper than for spirals (which 
have r e oc Ml^ 2 ). Understanding why is a challenge for mod- 
els in which ellipticals form from mergers of spirals. 

Although p e decreases strongly with M*, the average 
density on smaller scales (we chose 1 kpc) is almost inde- 
pendent of M» (Figure©. Moreover, it is remarkably similar 
to that found by Bezanson et al. (2009), in their analysis of 
z ~ 2.3 galaxies. The mean density within 1 kpc seems to 
be independent of the redshift and of the mass on an object 
by object basis. I.e., since z ~ 2, as these galaxies grew in 
mass, the mass in the inner kpc remained unchanged. Un- 
derstanding why is an interesting challenge. 

Although this is most easily accomplished in models 
where the mass is added to the outer regions only (e.g. Lapi 
& Cavaliere 2009, Cook et al 2009) - so it is tempting to con- 
clude that minor mergers were the dominant growth mode 
since z ~ 2 (e.g. Bezanson et al. 2009) - there is a direct 
counterexample to this conclusion in the literature. In nu- 
merical simulations of hierarchical structure formation, Gao 
et al. (2004) find that although the mass in the central re- 
gions of what becomes a massive cluster at z = has re- 
mained constant since z ~ 6, the particles which make up 
this mass changed dramatically as the objects assembled. 
This assembly occurred through a sequence of major merg- 
ers at z > 1; with minor mergers beginning to dominate the 
mass growth only at z < 1. Note that in hierarchical mod- 
els, what is true for cluster mass halos is also true for galaxy 
mass halos. While gastrophysics may complicate the discus- 
sion, we raise this as an example where mass growth due 
to major mergers does not lead to increased density in the 
central regions. This appears to be in remarkable agreement 
with what we see. If the z ~ 2 objects studied by Bezanson et 
al. (2009) are to evolve into the objects in our sample, then 
the required mass growth is about a factor of 5 (Figure © 
- this is larger than most minor merger models can accom- 
modate. It may well be that major mergers were required 
at z > 1.5 and that minor mergers become the dominant 
growth mechanism for massive galaxies only at lower red- 
shift (Bernardi 2009; Bernardi et al. 2010b). 

Our Figure [5] supports such a picture: major mergers 
would move the z = 2.3 objects approximately parallel to 
the solid lines in the two panels. A factor of 5 change in mass 
and size would bring them into much better agreement with 
the dotted and dashed z = relations in the panel on the 
left, but they would still lie slightly above the correspond- 
ing line in the panel on the right. Subsequent minor mergers 
would increase the sizes and decrease the velocity disper- 
sions, bringing both the sizes and densities into even better 
agreement. (Note that a small fractional increase in mass 
results in a larger fractional increase in size and an even 
larger fractional decrease in density. Indeed, because den- 
sity is proportional to (cr/r e ) 2 , minor mergers are a great 
way to decrease the density for a modest change in mass.) 
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Figure 7. Left: Relation between the stellar mass M*, the characteristic density p e , and the de Vaucouleurs radius r e . Right: Relation 
between M*, <xo and r e . 



If the high redshift objects indeed have a ~ 400 km s~ 
(as our Figure [5] may suggest), the decrease in a associated 
with minor mergers will (in fact, may be required to) bring 
the number density of large a objects into better agreement 
with that seen locally (Sheth et al. 2003). 

On the other hand, selection effects (e.g. due to the 
small volume observed) could limit the detection of these 
very massive galaxies. For example, the sample of ultramas- 
sive early-type galaxies of Mancini et al. (2010) is selected 
from a 2 deg 2 field. In such a volume, if galaxies of M* ~ 10 12 
Mq are yet present at z > 1.5, as predicted by model like 
Fan et al. (2010), the number of detections should be around 
unity and therefore not necessary detected. The detection of 
these massive objects (already difficult in the local universe) 
is thus a challenge for the high redshift universe. 
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APPENDIX A: PROLATE GALAXIES AND 
ANISOTROPY 

Bernardi et al. (2008) argue that the most massive of these 
objects are likely to be prolate, viewed along the long 
axis (see alse Thomas et al. 2007). If the shape is due to 
anisotropic velocities, then the velocity dispersion projected 
along the line of sight is 

2 2 r ^(B)4B) 

2r 2 [°° 0(R)P*(RWr(R) dR 
E(r) J r RV& ~ r> 

(Al) 

(e.g. Binney & Mamon, 1982), where /3(r) = 1 — a$/(2(T r ) is 
the anisotropy profile. When P — s> — oo the orbits are purely 
circular, while p — > 1 corresponds to radial orbits. 

We have tried different values of /3 from to 0.5, and 
found that the estimated dynamical mass decreases as P in- 
creases. If ft = 0.5 the dynamical estimate of M* is smaller 
by 0.6 dex, but the assumption that all the galaxies of the 
sample have /3 = 0.5 is inconsistent with most studies which 
favor p — (e.g. Thomas et al. 2009). Figure [51 which has 
the same format as Figure [3] in the main text, shows re- 
sults for P = 0.2. The dynamically estimated value of M» is 
slightly smaller compared to Figure [4] 
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